Evaluation of Automatically Reformulated Questions in Question Series

نویسندگان

  • Richard Shaw
  • Ben Solway
  • Robert J. Gaizauskas
  • Mark A. Greenwood
چکیده

Having gold standards allows us to evaluate new methods and approaches against a common benchmark. In this paper we describe a set of gold standard question reformulations and associated reformulation guidelines that we have created to support research into automatic interpretation of questions in TREC question series, where questions may refer anaphorically to the target of the series or to answers to previous questions. We also assess various string comparison metrics for their utility as evaluation measures of the proximity of an automated system’s reformulations to the gold standard. Finally we show how we have used this approach to assess the question processing capability of our own QA system and to pinpoint areas for improvement.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Quantitative and Qualitative Evaluation of Four Choice Questions In Shahrood University of Medical Sciences During 2021-2022

Background and Objective: Any test as a measurement tool must have sufficient validity and reliability to measure the desired attribute. Multiple-choice tests are the most common types of tests in medical education, which have a high degree of reliability, and this study was conducted with the aim of quantitative and qualitative evaluation of four-choice questions in Shahrood University of M...

متن کامل

Determination of the most important factors in overall effectiveness of a clinical teacher: students’ point of view

Introduction. Development of a faculty evaluation program determines the unique values and priorities of an education institute. Preparation of evaluation questionnaires on the basis of responders’ ideals and recognition, accuracy of analyses and practicality of assessment system are effective strategies for success. Elimination of inappropriate or parallel questions will shorten the questionna...

متن کامل

Investigating the Academic Achievement Evaluation of Specialized Theoretical Courses of Midwifery BS

Introduction: Evaluating the gap between educational goals and achievement is among the constant requirements of educational process. Using a well-developed test for academic achievement reflecting all educational goals and full syllabus content is a matter of importance. Regarding the magnitude of specialized theoretical courses of midwifery, researchers in this study attempted to assess the e...

متن کامل

Boosting Passage Retrieval through Reuse in Question Answering

Question Answering (QA) is an emerging important field in Information Retrieval. In a QA system the archive of previous questions asked from the system makes a collection full of useful factual nuggets. This paper makes an initial attempt to investigate the reuse of facts contained in the archive of previous questions to help and gain performance in answering future related factoid questions. I...

متن کامل

Assessment of Qualitative and Quantitative Indexes of Clerkship Tests in General Medicine

Introduction: Using multiple choice question tests, as an objective testing method, is the most common students evaluation procedure , and it is very important to design these tests properly.This study aimed to assess clerkship tests of general medicine courses in the training group of dermatology, psychiatry, gynecology, ophthalmology and neurology of medical college of Isfahan University of M...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008